WORLD n^TELLECrUAL PROPERTY ORGANIZATION 
International Bureau 




PCX 

INTERNATIONAL APPLICATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Clas^cation ^ : 
C07K 1/113 //C12N 15/51 



Al 



(11) International PubUcation Number: WO 96/35709 

(43) Internationa] PubUcation Date: 1 4 November 1 996 ( 1 4. 1 1 .96) 



(21) International Applicaticm Number: PCr/US96/06388 

(22) International Filing Date: 9 May 1996 (09.05.96) 



(30) Priority Data: 
08/439,680 
08/571,643 



12 May 1995 (12.05.95) US 

13 December 1995 (13.12.95) US 



(71) Applicant: SCHERING CORPORATION [USAJS]; 2000 Gal- 

loping Hin Road, Kcnilwoith, NJ 07033-0530 (US). 

(72) Inventors: RAMANATHAN, Lata; 32 Valley Way, West 

Orange. NJ 07052 (US). WENDEL, Michelc; 7 Robin Road. 
Fanwood, NJ 07023 (US). 

(74) Agents: LUNN. Paul, G. ct al.; Schering-Plough Corporation, 
Patent Dept. K-6-1 1990, 2000 Galloping Hill Road, 
Kenilworth, NJ 07033-0530 (US). 



(81) Designated States: AL. AM, AU, AZ, BB, BG. BR, BY. CA, 
CN, CZ, EE. GE, HU, IS. JP, KG. KR. KZ. LK. LR, LT, 
LV, MD. MG. MK, MN. MX. NO. NZ, PL. RO. RU. SG. 
SI. SK. TJ. TM. TR. TT, UA. UZ. VN. ARIPO patent (KE. 
LS. MW. SD. SZ. UG), Eurasian patent (AM, AZ, BY. KG, 
KZ, MD, RU, TJ. TKf), European patent (AT. BE, CH, DE. 
DK.ES.F1.fr, GB, GR. IE. IT, LU, MC. NL. PT. SE). 
OAPI patent (BF, BJ. CF. CG. CI, CM. GA. GN. ML, MR. 
NE. SN. TD. TG). 



Published 

Wish international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Titie: METHOD FOR REFOLDING INSOLUBLE AGGREGATES OF HEPATITIS C VIRUS PROTEASE 
(57) Abstract 

A method for solubilizing and refolding insoluble aggregates of HCV protease. 




FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCX on the front pages of pamphlets publishing international 
applications under the PCT, 



AM 


AnDcnia 


GB 


United Kingdom 


MW 


Malawi 


AT 


Austria 


GE 


Geofgia 


MX 


Mexico 


AU 


Australia 


GN 


Guinea 


NE 


Kiger 


BB 


Barbados 


GR 


Greece 


NL 


Noherlaiids 


BE 


Belgiuni 


HI) 


Hungary 


NO 


Norway 


BF 


Buriona Faso 


IE 


Ireland 


NZ 


New Zealand 


BG 


Bulgaria 


IT 


ka}y 


PL 


Poland 


BJ 


Benin 


JP 


Japan 


PT 


PoiTQgal 


BR 


Brazil 


KE 


Kenya 


RO 


Romania 


BY 


Belaitts 


KG 


Kyigystan 


RU 


Russian Federation 


CA 


Canttla 


KP 


Democratic People^s Republic 


SD 


Sudan 


CF 


CeotraJ African Republic 




.of Korea 


SE 


Sweden 


CC 


Congo 


KR 


Republic of Korea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 




Slovenia 


a 


CAte d' I voire 


U 


Liechtenstein 


SK 


Slovakia 


CM 


Cameroon * 


LK 


Sri Lanka 


SN 


Scftegal 


CN 


China 


LR 


Liberia 


SZ 


Swaziland 


cs 


Czechoslovakia 


LT 


Lithuania 


TD 


Chad 


cz 


Czech Republic 


LU 


Luxembourg 


TG 


Togo 


DE 


Germany 


LV 


Latvia 


TJ 


Tajiktstsn 


DK 


Denmailc 


MC 


Monaco 


TT 


Trinidad and Tobago 


EE 


Estonia 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MG 


Madagascar 


UG 


Uganda 


n 


Fin land 


ML 


Mali 


US 


United Stales of Americi 


FR 


France 


MN 


Mongolia 


UZ 


Uzbek btan 


GA 


Gabon 


MR 


Mauritania 


VN 


Vict Nam 



wo 96/35709 PCTAJS96/06388 



FOR REFOLD ING INSOLUBLE AGGREGATES OF 
HFPATTTISr VIRUS PROTEASE 



BACKGROUN n OF THF INVENTION 



10 

Hepatitis C virus (HCV) is considered to be the major etiological 
agent of non-A non-B (NANB) hepatitis, chronic liver disease, and 
hepatocellular carcinoma (HCC) aroimd the world. The viral infection 
accounts for greater than 90% of transfusion -associated hepatitis in U.S. 
1 5 and it is the predominant form of hepatitis in adults over 40 years of 
age. Almost all of the infections result in chronic hepatitis and nearly 
20% develop liver cirrhosis. 

The virus particle has not been identified due to the lack of an 
20 efficient in vitro replication system and the extremely low amount of 
HCV particles in infected liver tissues or blood. However, molecular 
cloning of the viral genome has been accomplished by isolating the 
messenger RNA (mRNA) from the serum of infected chimpanzees then 
cloned using recombinant methodologies. [Grakoui A. et al J. Virol 67: 
25 1385 - 1395 (1993)1 1* is now known that HCV contains a positive strand 
RNA genome comprising approximately 9400 nucleotides, whose 
organization is similar to that of flaviviruses and pestiviruses . The 
genome of HCV, like that of flavi- and pestiviruses, encodes a single 
large polyprotein of about 3000 amino adds which imdergoes proteolysis 
30 to form mature viral proteins in infected cells. 

Cell-free translation of the viral polyprotein and ceD culture 
expression studies have established that the HCV f)olyprotein is 
processed by cellular and viral proteases to produce the putative 
35 structural and nonstructural (NS) proteins. At least rune mature viral 
proteins are produced from the polyprotein by specific proteolysis. The 
order and nomenclature of the cleavage products are as follows: NH2-C- 
El-E2-NS2-NS3-NS4A-NS4B-NS5A-NS5B-COOH.(Fig 1). The three 
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amino terminal putative structural proteins, C (capsid). El, and E2 ( two 
envelope glycoproteins), are believed to be deaved by host signal 
peptidases of the endoplasmic reticulum (ER) . The host enzyme is also 
responsible for generating the amino terminus of NS2 . The proteolytic 

5 processing of the nonstructural proteins are carried out by the viral 
proteases: NS2-3 and NS3, contained within the viral p>olyprotein. The 
NS2-3 protease catalyzes the cleavage between NS2 and NS3. It is a 
metalloprotease and requires both NS2 and the protease domain of NS3. 
The NS3 protease catalyzes the rest of the deavages the substrates in the 

10 nonstructural part of the polyprotein. The NS3 protein contains 631 
amino add residues and is comprised of two enzymatic domains: the 
protease domain contained within amino acid residues 1-181 and a 
helicase ATPase domain contained within the rest of the protein. It is 
not known if the 70 kD NS3 protein is cleaved further in infected cells to 

1 5 separate the protease domain from the helicase domain, however, no 
deavage has been observed in cell culture expression studies. 



The NS3 protease is a member of the serine dass of enzymes. It 
contains His, Asp, and Ser as the catalytic triad, Ser being the active site 
20 residue. Mutation of the Ser residue abolishes the deavages at substrates 
NS3/4A, NS4A/4B, NS4B/5A, and NS5A/5B. The deavage between 
NS3 and NS4A is intramolecular, whereas the deavages at NS 4A/4B, 
4B/5A, 5A/5B sites occur in trans . 

25 Experiments ming transient expression of various forms of HCV 

NS polyproteins in mammalian cells have established that the NS3 
serine protease is necessary but not suffident for effident processing of 
all these deavages. Like flaviviruses, the HCV NS3 protease also 
requires a cofactor to catalyze some of these deavage reactions. In 

30 addition to the serine protease NS3, the NS4A protein is absolutely 

required for the deavage of the substrate at the 4B/5A site and increases 
the efficiency of deavage of the substrate between 5A/5B, and possibly 
4A/4B. 



35 Because the HCV NS3 protease deaves the non-structural HCV 

proteins which are necessary for the HCV replication, the NS3 protease 
can be a target for the development of therapeutic agents against the 
HCV virus. The gene encoding the HCV NS3 protein has been cloned 
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as disclosed in U.S. Patent No. 5371,017, however, not in a soluble active 
form. If the HCV protease is to be useful as a target in a screen to 
discover therapeutic agents, the protease must be produced in a soluble 
active form. Thus, there is a need for a soluble active form of the HCV 

5 protease which can be produced in large quantities to be used in high 
throughput screen to detect inhibitors of the protease and for structural 
studies. We have cloned and expressed the catalytic domain of NS3 
protease as a native protein and as fusion proteins in E. coli and in 
Yeast Fusion tags were used to facilitate purification and secretion into 

10 periplasmic space. All of these constructions resvdted in expression of 
NS3 protein only in insoluble form. Various attempts which include 
growing bacteria in different media and temperatures, expressing in 
different strains of E. coli failed to produce expression of soluble NS3. 
Thus, there is a need for a soluble active form of the HCV protease 

1 5 which can be used in a screen to test for potential therapeutic agents. 

Summary Of The Invention 

The present invention fills this need by providing for a process 
20 for producing soluble, proteolytically active, refolded HCV protease 
from insoluble, bacterially produced HCV protease aggregates. 
Insoluble, aggregates of HCV NS3 protease are extracted from bacteria 
producing said aggregates. The aggregates of protease are then 
solubilized in a buffer containing a denaturing reagent. The solubilized 
25 protease from are then placed in a buffer containing a reducing agent 
said buffer having an acidic pH. The denaturing reagent is then 
removed from the buffer under conditions wherein the buffer 
maintains an addic pH. The pH of the buffer containing the protease is 
then raised in a stepwise manner to a pH of about 7 - 8 so as to produce 
30 properly refolded soluble, active NS3 protease. 

In a preferred embodiment of the present invention, the 
insoluble protease is first extracted from the bacteria by homogenization 
or sonication of the bacteria. The aggregates containing the bacteria are 
35 then solubilized in a 5 M solution of guanidine hydrochloride (GuHCl). 
The NS3 protease is then purified from high molecular weight 
aggregates by size exclusion chromatography, as for example by applying 
the solution to a SEPHACRYL &-300 size exclusion gel chromatography. 
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Fractions containing the NS3 protease are collected and the solution 
comprised of 5 M solution of GuHCl is diluted to about 0.1 M GuHCl in 
a refolding buffer containing dithiothreitol and lauryl maltoside. 
The diluted solution- is then applied to a reverse phase chromatography 
column and pools containing the NS3 protease collected. The pH of the 
protease fractions is then raised in a stepwise manner to about 7.4 - 7.8 so 
as to produce properly refolded soluble, active NS3 protease, 

Brigf Pescription Of The Figures 

Figure 1 schematically depicts the HCV polyprotein. 

Figure 2 depicts the recombinant synthesis of plasmid pBJlOlS. 

1 5 Figure 3 depicts the recombinant synthesis of plasmid pTS56-9. 

Figure 4 depicts the recombinant synthesis of plasmid pT5His/HIV/183. 

pp^ailpd Pescripti nrf Of The Invention 

20 

The teachings of all references cited are incorporated herein in 
their entirety by reference. 

The amino add sequence of the NS3 protease catalytic domain is 
25 shown in SEQ ID NO: 1. Prior to the present invention the NS3 

protease could not be produced in a soluble form in sufficient quantities 
for extraction and purification. The present invention provides for a 
method to solubilize and refold bacterially produced soluble HCV 
protease. 

30 

According to the present invention, soluble HCV NS3 protease 
can be produced having the sequences shown in SEQ ID NO: 1 and SEQ 
ID NO: 4. The NS3 protease can also have a histidine tag fused to its 
amino acid terminus' for use in purifying the protein on a nickel (Ni^^) 
35 coated resin. See SEQ ID NO: 5. The protease is produced as insoluble 
aggregates, i.e. inclusion bodies, in bacteria such asE. colL 
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The insoluble HCV NS3 protease is first extracted from the 
bacteria by homogenization or sonication of the bacteria. The aggregates 
containing the bacteria are then solubilized in a solubilizing agent. 
Siiitable solubilizing agents are guanidine hydrochloride (GuHCl), urea 
5 and glutothiocyanate. Preferably the solubilizing agent is a 5 M solution 
of GuHCl. In a preferred embodiment, the solubilized NS3 protease is 
then purified from high molecular weight aggregates by size exclusion 
chromatography, as for example by applying the solution to a 
SEPHACRYL S-300 size exclusion gel column. Fractions containing the 

10 NS3 protease in the solubilizing agent are diluted in a refolding buffer 
containing a reducing agent. Examples of suitable reducing agents are 
dithiothreitol (DTT), dithioerythritol (DET) and ^-mercaptoethanol. 
The preferred refolding buffer contains about 10% DTT. The refolding 
buffer also preferably contains a non-ionic detergent. Examples of non- 

1 5 ionic detergents are lauryl maltoside, a polyoxyethylene ether such as 
TRITON X-100®, Nonidet a polyoxyethylene 9 -lauryl ether such 

as THESn^, (3-[(3-Cholamidopropyl)-dimethylammoruo]-l- 
propanesulfonate) (CHAPS), and octylglucoside. Preferably the 
insoluble aggragates of protease are solubilized in 5M GuHQ. Purified 

20 fractions from a size exclusion gel column are pooled and diluted to 

about 0.1 M GuHCl in a refolding buffer comprised of 10% dithiothreitol 
and 0.1% lauryl maltoside. The diluted solution is then applied to a 
reverse phase chromatography column and pools containing the NS3 
protease collected. The pH of the protease fractions is then raised in a 

25 stepwise manner to al>out 7-8, preferably 7.4 - 7.8, so as to produce 
properly refolded soluble, active NS3 protease. 

DNA encoding the NS3 protease of this invention can be 
prepared by chemical synthesis using the known nucleic acid 
sequence [Ratner et a/.. Nucleic Adds Res. 13:5007 (1985)] and 

30 standard methods such as the phosphoramidite solid support 

method of Matteucd etal.\J Am. Chem. Soc. 103:3185 (1981)] or the 
method of Yoo et al. Biol. Chem. 764:17076 (1989)]. See also Glide, 
Bernard R, and Pasternak, Molecular Biotechnology : pages 55 - 63, 
(ASM Press, Washington, D.C. 1994). The gene encoding the protease 

35 can also be obtained using the plasmid disdosed in Grakoui, A., 

Wychowski, C, Lin, C, Feinstone, S. M., and Rice, C. M., Expression 
and Identification of Hepatitis C Virus, polyprotein Cleavage 
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Products, /. Virol 67;1385-1395 (1993). Also, the nucleic acid encoding 
HCV protease can be isolated, amplified and cloned (from patients 
infected with the HCV virus). Furthermore, the HCV genome has 
been disclosed in PCT WO 89/04669 and are available from the 
5 American Type Culture Collection (ATCC), 12301 Parklawn Drive, 
Rockville, MD under ATCC accession no. 40394. 

Of course, because of the degeneracy of the genetic code, there 
are many functionally equivalent nucleic acid sequences ttiat can 
encode mature human HCV protease as defined herein. Such 
10 functionally equivalent sequences, which can readily be prepared 
using known methods such as chemical synthesis, PCR employing 
modified primers and site-directed mutagenesis, are within the scope 
of this invention. 



15 As used herein, the term "transformed bacteria" means bacteria 

that have been genetically engineered to produce a mammalian protein. 
Such genetic engineering usually entails the introduction of an 
expression vector into a bacterium. The expression vector is capable of 
autonomous replication and protein expression relative to genes in the 

20 bacterial genome. Construction of bacterial expression is well known in 
the art, provided the nucleotide sequence encoding a desired protein is 
known or otherwise available. For example, DeBoer in U.S. Pat. No. 
4,551,433 discloses promoters for use in bacterial expression vectors; 
Goeddel et al in U.S. Pat No. 4,601,980 and Riggs, in US. Pat. No. 

25 4,431,739 disclose the production of mammalian proteins by E. coli 

expression systems; and Riggs supra, Ferretti et al Proc, Natl Acad, Sri. 
83:599 (1986), Sproat rf al. Nucleic Acid Research 13:2959 (1985) and 
Mullenbach et al, /. Biol Chem 261:719 (1986) disclose how to construct 
synthetic genes for expression in bacteria. Many bacterial expression 

30 vectors are available commercially and through the American Type 
Culture Collection (ATCC), Rockville, Maryland. 

Insertion of DNA encoding human HCV protease into a 
vector is easily accomplished when the termini of both the DNA and 
35 the vector comprise the same restriction site. If this is not the case, it 
may be necessary to modify the termini of the DNA and /or vector by 
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digesting back single-stranded DNA overhangs generated by 
restriction endonuclease cleavage to produce blunt ends, or to 
achieve the same result by filling in the single-stranded termini with 
an appropriate DNA polymerase. Alternatively, any site desired may 
5 be produced by ligating nucleotide sequences (linkers) onto the 
termini. Such lir\kers may comprise specific oligonucleotide 
sequences that define desired restriction sites. The cleaved vector 
and the DNA fragments may also be modified if required by 
homop>olymeric tailing. 

1 0 Many E. co//-compatible expression vectors can be used to 

produce soluble HCV NS3 protease of the present invention, 
including but not limited to vectors containing bacterial or 
bacteriophage promoters such as the Tac, Lac, Trp, Lac IIV5, 1 Pr arid 1 
Pl promoters. Preferably, a vector selected will have expression 

1 5 control sequences that permit regulation of the rate of HCV protease 
expression. Then, HCV protease production can be regulated to 
avoid overproduction that could prove toxic to the host cells. Most 
preferred is a vector comprising, from 5* to 3' (upstream to 
downstream)/ a Tac promoter, a lac N repressor gene and DNA 

20 encoding mature hixman HCV protease. The vectors chosen for use 
in this invention may also encode secretory leaders such as the 
ompA or protein A leader, as long as such leaders are cleaved during 
post-translational processing to produce mature HCV protease or if 
the leaders are not cleaved, the leaders do not interfere with the 

25 enzymatic activity of the protease. 

Fusion peptides will typically be made by either recombinant 
nucleic acid methods or by synthetic polypeptide methods. Techniques 
for nucleic add manipulation and expression are described generally, 

30 e.g., in Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual 
(2d ed.), vols. 1-3, Cold Spring Harbor Laboratory; and Ausubel, et al. 
(eds.) (1993) Current Protocols in Molecular Biology, Greene and Wiley, 
NY. Techniques for synthesis of polypeptides are described, e.g., in 
Merrifield (1963) /. Amer. Chem. Soc. «5:2149-2156; Merrifield (1986) 

35 Science 232: 341-347; and Stewart et al (1984)., ""Solid Phase Peptide 
Synthesis" (2nd Edition), Pierce Chemical Co., Rockford, XL.; and . 
Atherton, et al. (1989) Solid Phase Peptide Synthesis: A Practical 



wo 96/35709 



PCTAJS96/06388 



-8- 



10 



15 



20 



25 



30 



Approach, IRL Press, Oxford; and Grant (1992) Synthetic Peptides: A 
User's Guide, W.H. Freeman, NY. 

The smaller peptides such as the NS4A cofactor, SEQ ID NO: 6, 7 
and 8, and the substrates 5A/5B, SEQ ID NO: 5, and 4B/5A, SEQ ID NO: 9 
can be synthesized by a suitable method such as by exclusive solid phase 
S)mthesis, partial solid phase methods, fragment conderisation or 
classical solution synthesis. The p>olypeptides are preferably prepared by 
solid phase peptide synthesis as described by Merrifield, J. Am. Chem. 
Soc. 55:2149 (1963). The synthesis is carried out with amino adds that 
are protected at the alpha-amino terminus. Trifunctional amino adds 
with labile side-chains are also protected with suitable groups to prevent 
undesired chemical reactions from occurring during the assembly of the 
polypeptides. The alpha-amino protecting group is selectively removed 
to allow subsequent reaction to take place at the amino-terminus. The 
conditions for the removal of the alpha-amino protecting group do not 
remove the side-chain protecting groups. 

The alpha-amino protecting groups are those known to be useful 
in the art of stepwise polypeptide synthesis. Induded are acyl type 
protecting groups {e,g., formyl, trifluoroacetyl, acetyl), aryl type 
protecting groups {e.g. , biotinyl), aromatic urethane type protecting 
groups [e.g., benzyloxycarbonyl (Cbz), substituted benzyloxycarbonyl and 
9-fiuorenylmethyloxy-carbonyl (Fmoc)], aliphatic urethane protecting 
groups [e.g., t-butyloxycarbonyl (tBoc), isopropyloxycarbonyl, 
cydohexyloxycarbonyl] and alkyl type protecting groups (e.g., benzyl, 
triphenyimethyl). The preferred protecting groups are tBoc and Fmoc, 
thus the peptides are said to be synthesized by tBoc and Fmoc chemistry, 
respectively. 

The side-chain protecting groups selected must remain intact 
during coupling and not be removed during the deprotection of the 
amino-terminus protecting group or during coupling conditions. 
The side-chain protecting groups must also be removable upon the 
completion of synthesis, using reaction conditions that will not alter 
the finished polypeptide. In tBoc chemistry, the side-chain protecting 
groups for trifunctional amino acids are mostly benzyl based. In 
Fmoc chemistry, they are mostly tert.-butyl or trityl based. 




wo 96/35709 PCTaJS96/a6388 

-9- 

In tBoc chemistry, the preferred side-chain protecting groups 
are tosyl for Arg, cydohexyl for Asp, 4-methylben2yl (and 
acetamidomethyl) for Cys, benzyl for Glu, Ser and Thr, 
5 benzyloxymethyl (and dinitrophenyl) for His, 2-Cl-ben2yloxycarbonyl 
for Lys, formyl for Trp and 2-bromobenzyl for Tyr. In Fmoc 
chemistry, the preferred side-chain protecting groups are 2,2,5,7,8- 
pentamethylchroman-6-sulfonyl (Pmc) or 2,2,4,6,7- 
pentamethyldihydrobenzofuran-S-sulfonyl (PbO for Arg, trityl for 
1 0 Asn, Cys, Gin and His, tert. butyl for Asp, Glu, Ser, Thr and Tyr, tBoc 
for Lys and Trp. 

For the synthesis of phosphopeptides, either direct or post- 
assembly incorporation of the phosphate group is used. In the direct 

15 incorporation strategy, the phosphate group on Ser, Thr or Tyr may 
be protected by methyl, benzyl or tert.butyl in Fmoc chemistry or by 
methyl, benzyl or phenyl in tBoc chemistry. Direct incorporation of 
phosphotyrosine without phosphate protection can also be used in 
Fmoc chemistry. In the post-assembly incorporation strategy, the 

20 improtected hydroxyl group of Ser, Thr or Tyr was derivatized on 
solid phase with di-tert.butyl-, dibenzyl- or dimethyl-N,N'- 
diisopropylphosphoramidite £ind then oxidized by 
tert.butylhydroperoxide. 

25 Solid phase synthesis is usually carried out from the carboxyl- 

terminus by coupling the alpha-amino protected (side-chain 
protected) amino add to a suitable solid support. An ester linkage is 
formed when the attachment is made to a chloromethyl, chlortrityl 
or hydroxymethyl resin, and the resulting polypeptide wUl have a 

30 free carboxyl group at the C-terminus. Alternatively, when an amide 
resin such as benzhydrylamine or p-methylbenzhydrylamine resin 
(for tBoc chemistry) and Rink amide or PAL resin (for Fmoc 
chemistry) is used, an amide bond is formed and the resulting 
polypeptide, will have a carboxamide group at the C-terminu5. These 

35 resins, whether polystyrene- or polyamide-based or 

polyethyleneglycol-grafted, with or without a handle or linker, with 
or without the first amino add attached, are commerdally available, 
and their preparations have been described by Stewart et al (1984)., 
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""Solid Phase Peptide Synthesis'" (2nd Edition), Pierce Chemical Co., 
Rockford, IL.; and Bayer & Rapp (1986) Chem. Pept. Prot. 3, 3; and 
Atherton, et al. (198*9) Solid Phase Peptide Synthesis: A Practical 
Approach, IRL Press, Oxford. 

5 

The C-tenniiul amino acid, protected at the side-chain if 
necessary and at the alpha-amino group, is attached to a 
hydroxylmethyl resin using various activating agents including 
dicydohexylcarbodiimide (DCC), N,N*-diisopropylcarbodiimide 

1 0 DIPCDI) and carbonyldiimidazole (CDI). It can be attached to 
chloromethyl or chlorotrityl resin directly in its cesium 
tetramethylammonium salt form or in the presence of triethylamine 
(TEA) or diisopropylethylamine (DIEA). First amino acid 
attachment to an amide resin is the same as amide bond formation 

1 5 during coupling reactioi^ 

Following the, attachment to the resin support, the alpha- 
amiiK) protecting group is removed using various reagents 
depending on the protecting chemistry (e.g. , tBoc, Fmoc). The extent 
20 of Fmoc removal can be monitored at 300-320 nm or by a 

conductivity cell. After removal of the alpha-amino protecting 
group, the remaining protected amino acids are coupled stepwise in 
the required order to obtain the desired sequence. 

25 Various activating agents can be used for the coupling 

reactions including DCC, DIPCDI, 2-chloro-13-din\ethylimidium 
hexafluorophosphate (CIP), benzotriazol-l-yl-oxy-tris- 
(dimethylamino)-phosphonium hexafluorophosphate (BOP) and its 
pyrrolidine analog (PyBOP), bromo-tris-pyrrolidino-phosphonium 

30 hexafluorophosphate (PyBroP), O -(benzotria2ol-l-yl)-l,133- 
tetramethyluronium hexafluorophosphate (HBTU) and its 
tetrafluoroborate analog (TBTU) or its pyrrolidine analog (HBPyU), 
O -(7-azaben20tria2ol-l-yl)-l,l,3,3-tetramethyluronium 
hexafluorophosphate (HATU) and its tetrafluoroborate analog 

35 (TATU) or pyrrolidine analog (HAPyU). The most common catalytic 
additives used in coupling reactions include 4- 
dimethylaminopyridine (DMAP), 3-hydroxy-3,4-dihydro-4-oxo-l,23- 
benzotriazine (HODhbt), N-hydroxybenzotriazole (HOBt) and 1- 
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hydroxy-7-azaben20triazole (HOAt). Each protected amino add is 
used in excess (>2.0 equivalents), and the couplings are usually 
carried out in N-methylpyrrolidone (NMP) or in DMF, CH2CI2 or 
mixtures thereof. The extent of completion of the coupling reaction 
5 can be monitored at each stage, e.g^ by the ninhydrin reaction as 
described by Kaiser et cd., Anal Biochem. 34:595 (1970). In cases 
where incomplete coupling is found, the coupling reaction is 
extended and repeated and may have chaotropic salts added. The 
coupling reactioiis can be performed automaticaUy with 
10 commercially available instruments such as ABI model 430A, 431 A 
and 433A peptide synthesizers. 

After the entire assembly of the desired polypeptide, the 
polypeptide-resin is cleaved with a reagent with proper scavengers. 

1 5 The Fmoc peptides are usually cleaved and deprotected by TFA with 
scavengers (e.g., H2O, ethanedithiol, phenol and thioanisole). The 
tBoc peptides are visually cleaved and deprotected with liquid HF for 
1-2 hours at -5 to 0*C, which cleaves the polypeptide from the resin 
and removes most of the side-chain protecting groups. Scavengers 

20 such as anisole, dimethylsulfide and p-thiocresol are usually used 
with the liquid HF to prevent catioi\s formed during the cleavage 
from alkylating and acylating the amino acid residues present in the 
polypeptide. The formyl group of Trp and dinitrophenyl group of His 
need to be removed, respectively, by piperidine and thiophenol in 

25 DMF prior to the HF cleavage. The acetamidomethyl group of Cys 
can be removed by mercury(II) acetate and alternatively by iodine, 
thalliimi (HI) trifluoroacetate or silver tetrafluoroborate which 
simultai\eously oxidize cysteine to cystine. Other strong adds used 
for tBoc peptide cleavage and deprotection include 

30 trifluoromethanesulfonic add (TFMSA) and 
trimethylsilyltrifluoroacetate (TMSOTf). 

The following examples are induded to illustrate the present 
invention but not to limit it. 

35 
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Example 1 



Production of HCV NS3 Protease 



10 



15 



20 



25 



30 



A. Plasmid constructions. 

Several plasmids were designed and constructed using standard 
recombinant DNA techniques (Sambrook, Fritsch & Maniatis) to express 
the HCV protease in E. coJi (Fig 2-7). All HCV specific sequences 
originated from the parental plasmid pBRTM/HCV 1-3011 (Grakoui et 
al 1993), To express the N-terminal 183 amino add versions of the 
protease, a stop oodon was inserted into the HCV genome using 
synthetic oligonucleotides (Fig. 3). The plasmids designed to express the 
N-terminal 246 amino acid residues were generated by the ruitural Ncol 
restriction site at the C-terminus. 

i) Construction of the plasmid pBJlOlS (Figure 2) 

The plasmid pBRTM/HCV 1-3011 containing the entire HCV genome 
(Grakoui A., et al,}. Virol 67: 1385-1395) was digested with the 
restriction enzymes Sea I and Hpa I and the 7138 bp (base pair) DNA 
fragment was isolated and cloned to the Sma I site of pSP72 (Promega) to 
produce the plasmid, pRJ201, The plasmid pRJ 201 was digested with 
Msc I and the 2106 bp Msc I fragment was isolated and cloned into the 
Sma I site of the plasmid pBD7. The resulting plasmid pMBM48 was 
digested with Kas I and Nco I, and the 734 bp DNA fragment after blimt 
ending with Klenow polymerase was isolated and cloned into Nco I 
digested, klenow polymerase treated pTrc HIS B seq expression plasmid 
(Invitrogen). The ligation regenerated a Nco I site at the 5* end and Nsi I 
site at the 3' end of HCV sequence. The plasmid pTHB HCV NS3 was 
then digested with Nco I and Nsi I, and treated with klenow polymerase 
and T4 DNA polymerase, to produce a blimt ended 738 bp DNA 
fragment which was isolated and cloned into Asp I cut, klenow 
polymerase treated expression plasmid pQE30 (HIV). The resulting 
plasmid pBJ 1015 expresses HCV NS3 (246 amino adds) protease. 
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(ii) Construction of the plasmid pTS 56-9 with a stop codon after 
amino add 183 (Figxire 3) 

The plasmid pTHB HCV NS3 was digested with Nco I, treated 
5 with klenow polymerase, then digested with Bst Y I; and the DNA 
fragment containing HCV sequence was isolated and cloned into Sma I 
and Bgl n digested pSP72. The resulting plasmid pTS 49-27 was then 
digested with Bgl n and Hpa I and ligated with a double stranded 
oligonucleotide: 

10 GA TCA CCG GTC TAG ATCT 

T GGC GAG ATc TAGA (SEQ E) NO 3) to produce pTS 56-9. 
Thus, a stop codon was placed directly at the end of DNA encoding the 
protease catalytic domain of the NS3 protein. This enabled the HCV 
protease to be expressed independently from the helicase domain of the 
15 NS3 protein. 



(iii) Construction of the plasmid pT5 His HTV-NS3 (Figure 4) 

20 The plasmid pTS56-9 was digested with Bgl H, and treated with 

Klenow polymerase to fill in 5' ends. The plasmid was then digested 
with NgoM I and the blimt ended Bgl H/NgoMI fragment containing 
the NS3 sequence was isolated and ligated to the Sgll, Klenow treated 
NgmMI cut and Sal I klenowed pBJ 1015. The resulting plasmid is 

25 designated pTSHis HIV 183. 

l^pfnlHin^ of Insoluble HCV NS3 Protease 

30 

The present example describes a novel process for the refolding of 
HCV NS3 protease which does not have a solubilizing motif from an E, 
coli inclusion body pellet. This procedure can be used to generate 
purified enzyme for activity assays and structural studies. 

35 



Extraction and Purifiration of H i«;-HTV 183 from the E. coli inclusion 

body pellet 
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£. coli cells harboring the plasmid for HisHIV183 was used to 
transform a culture of E. coli strain M15 [pREP] (Qiagen), which over- 
expresses the lac repressor, according to raethods recommended by 
5 commercial source. M15 [pREP] bacteria harboring recombinant 
plasmids were grown overnight in 20-10-5 broth supplemented with 
lOOjig/ml ampidllin and 25p.g/ml kanamycin. Cultures were diluted to 
O.D.600 of 0.1, then grown at 37^ to O.D.600 of 0.6 to 0.8, after which 
IPTG was added to a final concentration of ImM. At post-induction 2 to 

10 3 hours, the cells were harvested by pelleting, and the cell pellets were 
washed with lOOmM Tris, pH 7.5. were pelleted by centrifugation. The 
cell pellet was resuspended in 10 ml of O.IM Tris-HCl, 5mM EDTA, pH 
8.0 (Buffer A) for each gm wet weight of pellet. The pellet was 
homogenized and resuspended using a Doimce homogenizer. The 

1 5 suspension was clarified by centrifugation at 20,000 x g for 30 minutes at 
4°C. The pellet was sequentially washed with the following five buffers: 



1. Buffer A 

20 2, l.OM sodium chloride (NaCl) in buffer A 
3. 1.0% Triton X-100 in buffer A 
4. Buffer A 

5. 1.0 M Guanidine HCl ( GuHCl) in buffer A. 



25 The washed pellet was solubilized with 5M GuHQ, 1% beta 

mercaptoethanol in buffer A (3 ml per gm wet wt. of pellet) 
using a Dounce homogenizer and centrifuged at 100,000 x g for 30 
minutes at 4**C. Purification of denatured HisHIV183 from high 
molecular weight aggregates was accomplished by size exclusion on a 

30 SEPHACRYL S-300 gel filtration column. 

In particular, an 8 ml sample of the 5.0M GuHCl E. coli extract 
was applied to a 160 ml Pharmacia S-300 column (1.6 x 100 cm) at a flow 
rate of 1.0 ml/min. The column buffer was comprised of 5.0 M GuHCl, 
35 0.1 M Tris-HCl, pH 8.0, and 5.0 mM EDTA. The fraction size was 5.0 ml. 
Appropriate fractions were pooled based on the results of SD5-PAGE, as 
well as N-terminal sequence analysis of the protein transferred to a Pro- 
Blot. 
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Detergent-assisted refolding of HCV-protease 

The protein was concentrated by ultrafiltration using a 43 irun 
Amicon YMIO membrane to 1.0 mg per ml in 5M GiiHCl, O.IM Tris-HCl 
pH 8.0, 1.0 mM EDTA, 1.0% beta-mercaptoethanol. It was then diluted 
50-fold to O.IM GuHCl in refolding buffer (100 mM sodium phosphate 
pH 8.0, lOmM DTT, 0.1% lauryl maltoside) and the mixture was 
incubated on ice for at least one hour. A 25 ml sample containing 500 \ig 
of the protein in the refolding buffer was applied to a Pro-RPC HR 3/5 
reversed phase chromatography column. The applied sample contained 
500 ^ig protein in 25 ml of refolding buffer. To the column was then 
applied a solution B comprised of 99*9% H2O + 0.1% trifluoroacetic add 
(TFA). A 10 ml volume of solution C [10% H2O, 90% acetonitrile (AcN) 
+ 0.1% TFA] was applied to the column at a 0 - 60% gradient into 
solution B at a flow rate of 0.5ml/min. and a fraction size of 0.5ml. The 
fractions were monitored at A214; 2.0 absorbance imits full scale (AUFS). 

Fractions contairung the protein (corresponding to peak 1) were 
pooled for renaturation by stepwise dialysis. The fractions were first 
dialysed in 0.1% TFA in 25% glycerol overnight at 4X. These pooled 
fractions had a concentration of 0.1% TFA, 40% acetonitrile and a pH of 
less than 1. The fractions were then dialyzed in 0.01% TFA in 25% 
glycerol overnight at 4^C raising the pH to about 2; then dialyzed in 
0.001% TFA in 25% glycerol for 3.0 hoiu-s raising the pH to about 3; then 
dialyzed for 3 hours at 4^C in 50 mM NaP04, pH 6.0, 10 mM DTT in 25% 
glycerol raising the pH to about 6. The protein was then dialyzed for 3.0 
hours at 4*^C in 50 mM NaP04, pH 7.0, 0.15 M NaCl, 10 mM DTT in 25% 
glycerol; and then finally dialyzed in 50 mM NaP04, pH 7.8, 0.3 M NaCl, 
10 mM DTT, 0.2% Tween 20 in 25% glycerol. This resulted in purified, 
refolded, soluble, active HCV NS3 protease resulting in a solution 
having a pH of about 7.4 - 7.8. 

Far UV circular dichroism (CD) artalysis of the protein was used 
to monitor the refolding from an acid denatured state to a folded state at 
neutral pH. The protein recovery was monitored by a UV scan and SDS- 
PAGE analysis. 
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Detergent-assisted Refolding of His-HIV183 

5 HisHIV183 was quantitatively extracted from an E. coli inclusion 

body pellet. SDS-PAGE analysis at the various stages of extraction shows 
that sequential washes are essential to remove significant amounts of 
the contaminating proteins. HisHIV183 was extracted from the washed 
inclusion body pellet in the presence of 5M GxiHCl. The 5M GuHCl 
1 0 extract was applied to a SEPHACRYL S-300 column and the appropriate 
fractions were pooled based on SDS-PAGE analysis. The aihino acid 
sequence of the first ten residues was verified. 

Refolding was p)erformed at very low concentrations of protein, 
15 in the presence of DTT, lauryl maltoside and glycerol at 4°C. The diluted 
protein was concentrated on a Pro-RPC reversed phase column. Two 
peaks were obtained based on the UV and protein profile. Only Peak 1 
has yielded soluble protein after stepwise dialysis. Far UV CD spectral 
aitalysis was used to monitor refolding from a denatured state at add pH 
20 to a folded state at neutral pH. At pH 7.4, the protein was found to 

exhibit significant amoimts of secondary structure that is consistent with 
that of beta sheet protein. At low pH, the CD spectrum showed that it is 
fully random coil, having a minimal molar elliptidty at 20Qnm. The 
ratio of this minimimi at 200nm to that of the shoulder at 220 nm is 
25 approximately 4:1. This ratio decreased when the secondary structure 
formation occurred it neutral pH. 

A UV scan at each step of dialysis showed that the protein 
recovery was >90% up to pH 7.0 and that there was no light scattering 

30 effect due to protein aggregates. SDS-PAGE analysis also indicated that 
there was no loss of protein up to pH 7.0 during refolding. Predpitation 
of protein occurred at the last step of dialysis, and the soluble protein 
was darified by centrifugation. The overall protein recovery was about 
10%. The refolded protein was found to be active in a trans-cleavage 

35 assay using the in znfro-translated 5A/5B substrate in the presence of 4A 
peptide. 
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Analysis of NS3 Protease Activity By In Vitro Translation Assay 

5 To detect HCV NS3 protease activity in trans, we have expressed a 

40 kD protein contaiiung the NS5A/5B cleavage site in cell-free 
translation system and used that as the substrate for the enzyme. The 
substrate protein produces two protein products of apparent molecular 
weight 12.5 kD (NS 5A') and 27 kD (NS5B*) upon cleavage by the HCV 
10 NS3 protease. 

The plasmid pTS102 encoding the substrate 5A/5B was linearized 
by digestion with EcoR I and was transcribed using T7 RNA polymerase 
in vitro. The RNA was translated in presence of ^^s methionine in 

1 5 rabbit reticulocyte lysates according to the manufacturer's (Promega ) 
protocol to produce HCV specific protein. In a 20 ^il total reaction 
mixture containing lOmM Tris, pH 7.5, ImM DTT, 0.5mM EDTA, and 
1070 glycerol was placed 2 to 8 |il of methionine-labeled translated 
5A/5B substrate. The reaction was started with the addition of 10^1 of 

20 HCV NS3 protease (SEQ ID NO: 2) with an approximately equimolar 
amount (2 ^M) of the carboxyterminal 33 mer cofactor NS4A (SEQ ID 
NO: 7) in solubilization buffer (50mM Na Phosphate, pH 7.8, 0.3M NaCl, 
0.2% Tween 20, 10 mM DTT or BME, 10% glycerol), and incubated at 
30°C for about one hour. Reactions were stopped by adding an equal 

25 volume of 2X Laemmli sample buffer (Enprotech Inc.) and heating at 
100°C for 3 minutes. Reaction products were separated by SDS PAGE 
electrophoresis; gels were fixed, dried and subjected to autoradiography. 

The assay was able to cleave 5A/5B substrate in a dose responsive 
30 manner, producing the expected cleaved products: 5A (12.5 kD) and 5B 
(27 kD) as shown by SDS PAGE analysis. The production of cleaved 5A 
and 5B polypeptides from the 5A/5B substrate is proof that soluble, 
active, refolded HCV protease was indeed produced by the process of the 
present invention. 

35 
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DetergenNAssisted Refolding of the Catalyric Domain. His^HIV 183 
HCV protease catalytic domain has been expressed in high 
5 concentrations in Exoli as an inclusion body pellet, and is amenable to 
refolding studies. His-HrVNS3183 contains a six-residue polyhistidine 
tag, a 27 residue HIV protease cleavage sequence and a serine protease 
domain of 183 amino acids. His-HIVNS3 183 was extracted with 5M 
GuHCL according to the procedure of Example 2. A sample of the 5.0M 
1 0 GuHCl E. coH extract was applied to a 500 ml Pharmacia S-300 column 
(5.0 x 100 cm) at a flow rate of 4ml/min. The column buffer was the 
same buffer used in Example 2. About lOOmg of highly purified protein 
was obtained for refolding studies. 

15 The fractions containing the protein (1.0 mg / ml) were collected 

and diluted 50-fold in buffer A (100 mM sodium phosphate pH 7.8, 25% 
glycerol, 0.1% lauryl maltoside and 10 mM DTT) and immediately 
applied to a POROS 20R1 reversed phase column. A main peak and a 
shoulder were eluted with a 0-60% acetonitrile gradient in 0,1 %TFA. 

20 Only the main peak, not the shoulder, yielded active protease using a 
stepwise dialysis procedure. 

Fractions containing the protein (corresponding to peak 1) were 
pooled for renaturation by stepwise dialysis. The fractions were first 

25 dialysed in 0.1% TEA in 25% glycerol overnight at 4°C. These pooled 
fractions had a concentration of 0.1% TEA, 40% acetonitrile and a pH of 
less than 1. The fractions were then dialyzed in 0.01% TEA in 25% 
glycerol overnight at 4°C raising the pH to about 2; then dialyzed in 
0.001% TEA in 25% glycerol for 3.0 hours raising the pH to about 3; then 

30 dialyzed for 3 hours at 4''C in 50 mM NaP04, pH 6.0, 10 mM DTT in 25% 
glycerol raising the pH to about 6. The protein was then dialyzed for 3.0 
hours at 4^C in 50 mM NaP04, pH 7.0, 0.15 M NaCl, 10 mM DTT in 25% 
glycerol; and then finally dialyzed in 50 mM NaP04, pH 7.8, 0.3 M NaCl, 
10 mM DTT, in 25% glycerol. This resulted in purified, refolded, 

35 soluble, active HCV NS3 protease resulting in a solution having a pH of 
about 7.4 - 7.8. This resulted in an approximate 27% yield of active 
protease (>95% purity). 
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The refolded protein was found be active in the presence of a 
N54A peptide (33-mer) in the in i;rtro-translation assay using a 
truncated 5A-5B substrate. Three small scale refolding experiments (1.0 
& 10.0 and 20 mg ) gave reproducible yields (30%) of active soluble 
5 protease. We performed a loading study on the reverse phase column to 
improve the recovery of refolded protein. A 2.5 mg scale refolding gave 
27% recovery of active protease. Refolding of HCV protease from a one 
liter fermentation is estimated to give 4-5 mg of active protein. 



10 We have studied the enhancement activity of NS4A peptides on the 
activity of refolded HCV protease in the SPA assay. Kinetics of this 
enzyme has been determined with the urilabeled peptide in the HPLC 
assay. (Table) 



15 

Table Kinetics of Refolded HCV Protease Catalytic Domain 
Determined in the presence of NS4A (22-54) 



20 Non^Linear Regression 
Km= 63.626+/- 19.834 nM 

Vmax= 22.9 +/- 3.397 pmoles / min / 0.5pig enzyme 
kcat = 105 min-^ 
25 kcat/ Km = 264.7 M'V^ 



preliminary Deterge nt-Assistcd Refolding of NS3 631 

30 The full-length HCV protease NS3 631 was extracted from an 

Exoli inclusion body pellet and purified using Sephacryl S-300 
chromatography. Forty milligrams of highly purifed NS3 631 has been 
obtained from a six liter fermentation. This protein migrated as a 
doublet on SDS-PAGE under reducing conditions. N-terminal 

35 sequencing of the two immunoreactive bands indicated that the 

majority of the protein has a blocked N-terminus. The biochemical basis 
for the heterogeneity is unknown. Using modified detergent-assisted 
refolding scheme that was described for HisHIV183, low amounts of 
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soluble protein was obtained. The procedure was modified by including 
0.5M cirginine hydrochloride in tixe refolding buffer. The refolded 
protein showed activity in the presence of NS4A peptide in the in vitro- 
translation assay using truncated 5A-5B as a substrate. 

5 



10 



15 



20 



25 



30 



35 
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SEQUENCE USTING 



5 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Schering Corporation 

10 

(ii) TITLE OF INVENTION: Method for Refolding Insoluble 
Aggregates of Hepatitis C Virus Protease 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schering-Plough Corporation 

(B) STREET: 2000 Galloping Hill Road 

(C) CITY: Kenilworth 
20 (D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07033-0530 

(v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Apple Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.1 

(D) SOFTWARE: Microsoft Word 5.1a 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPUCATION NUMBER: 

(B) FILING DATE: 

(C) CLASSfflCATION: 

35 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER; 08/439,680 

(B) FILING DATE: May 12, 1995 
(vii) PRIOR APPUCATION DATA: 
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(A) APPUCATION NUMBER 08/571,643 

(B) FILING DATE: December 13, 1995 

(viii) ATTORNEY/ AGENT INFORMATION: 
5 (A) NAME: Lvmn, Paul G. 

(B) REGISTRATION NUMBER: 32,743 

(C) REFERENCE/DOCKET NUMBER: JB0508K 

(ix) TELECOMMUNICATION INFORMATION: 
1 0 (A) TELEPHONE: 908-298-5061 

(B) TELEFAX: 908-298-5388 

(2) INFORMATION FOR SEQ ID NO:l: 

1 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: HCV NS3 Protease 

25 



CCG CCC ATC ACG GCG TAG GCC CAG CAG ACG AGA GGC CTC CTA GGG 45 
Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
30 1 5 10 15 

TCT ATA ATC ACC AGO CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG 90 

Cys lie He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 
20 . 25 30 

35 

GGT GAG GTC CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA 135 

Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr Phe Leu Ala 
35 40 45 
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ACG TGC ATC AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA 180 
Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly 
50 55 60 

5 

ACG AGG ACC ATC GCA TCA GCC AAG GGT OCT GTC ATC CAG ATG TAT 225 
Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr 
65 70 75 

10 ACC AAT GTG GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT 270 
Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
80 85 90 

TCC CGC TCA TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC 315 
15 Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr 

95 100 105 

CTG GTT ACG AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT 360 
Leu Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly 
20 110 115 120 

GAT AGC AGG GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA 405 
Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu 
125. 130 135 

25 

AAA GGC TCC TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC 450 
Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala 
140 145 150 



30 GTG GGC CTA TTC AGG GCC GCG GTG 
Val Gly Leu Phe Arg Ala Ala Val 
155 

GCG GTG GAC TTT ATC CCT GTG GAG 
35 Ala Val Asp Phe He Pro Val Glu 

170 



TGC ACC CGT GGA GTG ACC AAG 495 
Cys Thr Arg Gly Val Thr Lys 
160 165 

AAC CTA GAG ACA ACC ATG AG A 540 
Asn Leu Glu Thr Thr Met Arg 
175 180 



TCC CCG GTG 
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Ser Pro Val 



(2) INFORMATION FOR SEQ ID N0:2: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
1 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
15 (A) NAME/KEY: pT5His/HIV/183 



ATG AGA GGA TCG CAT CAC CAT CAC CAT CAC GGA TCC CAT AAG GCA 45 
Met Arg Gly Ser His His His His His His Gly Ser His Lys Ala 
15 10 15 



20 



AGA GTT TTG GCT GAA GCA ATG AGC CAT GGT ACC ATG GCG CCC ATC 90 
Arg Val Leu Ala Glu Ala Met Ser His Gly Thr Met Ala Pro He 
20 25 30 

25 ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG TGT ATA ATC 135 
Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly Cys He He 
35 40 45 

ACC AGC CTG ACT GGCT CGG GAC AAA AAC CAA GTG GAG GGT GAG GTC 180 
30 Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val 

50 55 60 

CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA ACG TGC ATC 225 
Gin He Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr Cys He 
35 ' 65 70 75 

AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 270 
Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
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80 85 90 



ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT ACC AAT GTG 315 
5 He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val 

95 100 105 

GAC CAA GAC CTT GTG GGC TGG CCG GCT CCT CAA GGT TCC CGC TCA 360 
Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser 
10 110 115 120 

TTG AC A CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC CTG GTT ACG 405 

Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 
125 130 135 

15 

AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT GAT AGC AGG 450 

Arg His Ala Asp Val He Pro Val Arg iKrg Arg Gly Asp Ser Arg 
140 145 150 

20 GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA AAA GGC TCC 495 
Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
155 160 165 

TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC GTG GGC CTA 540 
25 Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu 

170 175 180 

TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG GCG GTG GAC 585 
Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys Ala Val Asp 
30 185 190 195 

TTT ATC CCT GTG GAG AAC CTA GAG ACA ACC ATG AGA TCC CCG GTG 630 

Phe He Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Val 
200' 205 210 

35 



(2) INFORMATION FOR SEQ ID NO:3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: double 
5 P) TOPOLOGY: double 

(ii) MOLECULE TYPE: cDNA 

10 GA TCA CCG GTC TAG ATCT 

T GGC CAG ATC TAGA 

2) INFORMATION FOR SEQ ID NO:4: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino adds 

(B) TYPE: amino add 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 
25 (A) NAME/KEY: histidine tag 



Met Arg Gly Ser His His His His His His Thr Asp Pro 
5 10 

30 

(2) INFORMATION FOR SEQ ID NOS: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 amino adds 
35 (B) TYPE: amino add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/BCEY: Mutant Soluble 5A/5B Substrate 

5 

Asp Thr Glu Asp Val Val Ala Cys Ser Met Ser Tyr Thr Trp Thr 
5 10 15 



Gly Lys 



10 



(2) INFORMATION FOR SEQ ID NO:6: 



(i) SEQUENCE CHARACTERISTICS: 
1 5 (A) LENGTH: 162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unear 

20 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Native NS4A 

TCA ACA TGG GTG CTC GTT GGC GGC GTC CTG GCT GOT CTG GCC GCG 45 
25 Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
1 5 10 15 

TAT TGC CTG TCA AO GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC 90 
Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val Gly Arg He Val 
30 20 25 30 

TTG TCC GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC 135 
Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr 
35 40 45 



35 



CAG GAG TTC GAT GAG ATG GAA GAG TGC 
Gin Glu Phe Asp Glu Met Glu Glu Cys 
50 
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(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 33 amino add residues 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unear 

1 0 (ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: Carboxl 33 mer of NS4A 



15 Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys Pro Ala 

5 10 15 



20 



He He Pro Asp Ar^ Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 

20 25 30 

Glu Glu Cye 



(2) INFORMATION FOR SEQ ID NaS: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 33 amino add residues 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: Unear 

(u) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

35 (A) NAME/KEY: Carboxl 33 mer of NS4A of HCV-BK strain 



Ser Val Val He Val Gly Arg He He Leu Ser Gly Arg Pro Ala 

5 10 15 
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Ile Val Pro Asp Arg Glu Leu Leu Tyr Gin Glu Phe Asp Glu Met 
20 25 30 

5 Glu Glu Cys 

^ (2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino add residues 

(B) TYPE: amino add 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: Soluble 4B/5A Substate 

20 Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu 

5 10 15 

Arg Asp He Trp Asp 
20 

25 



30 



10 



15 
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5 WE CLAIM: 

1. A process for producmg soluble, protealytically active, refolded HCV 
protease from insoluble, bacterially produced HCV protease aggregates 
comprising: 

10 

(a) extracting insoluble, aggregates of HCV NS3 protease from 
bacteria producing said aggregates; 

(b) solubilizing the aggregates of protease in a buffer contairung a 
1 5 denaturing reagent; 

(c) placing solubilized protease from step (b) in a buffer containing 
a reducing agent said buffer having an acidic pH; 

20 (d) removing the denaturing reagent from the buffer under 

conditions v^herein the buffer maintains an acidic pH; and 

(e) raising the pH of the buffer containing protease in a stepwise 
manner to a pH of about 7 - 8 so as to produce properly refolded soluble, 
25 active NS3 protease. 

2. The process of claim 1 wherein the denaturing agent is guanidine 
hydrochloride (GuHCl). 

30 3, The process of claim 2 wherein the solution of GuHCl contains 
GuHQ at a concentration of about 5M. 

4. The process of claim 1 wherein the reducing agent is dithiothreitol or 
p-mercaptoethanol. 

35 

5. The process of claim 1 wherein the buffer containing the reducing 
agent also contains a non-ionic detergent. 
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6. The process of claim 5 wherein the non-ionic detergent is selected 
from the group cor\sisting of lauryl maltoside, a polyoxyethylene ether , 
nonidet P-40, a polyoxyethylene 9 -lauryl ether such as, (3-[(3- 
cholamidopropyl)-dimethylammonio]-l-propanesulfonate) (CHAPS), 

5 and octylglucoside. 

7. The process of claim 6 wherein the solubilized protease of step (c) is in 
a 5M GuHCl solution and wherein the protease is reduced by diluting 
the 5M GuHCl solution with a buffer contaming about lOmM DTT and 

10 0.1% lauryl maltoside. 

8. The process of claim 1 wherein the denaturing reagent is removed in 
step (c) by applying the fractions contairung the protease to a reverse 
phase chromatography column tmder conditions wherein fractions 

1 5 collected have an acidic pH. 

9. The process of claim 8 wherein after the buffer containing the 
protease of step (c) is applied to the reverse phase chromatography 
column, a solution containing 99.9% H2O and 0.17o triflouroacetic acid 

20 (TFA) is added to the column. 

10. The process of claim 8 further comprising after adding the solution 
of 99.9% H2O + 0.1% TFA adding a solution comprised of 10% H2O + 
90% acetonitrile + 0.1% TFA to the column at a 0 - 60% gradient into the 

25 solution of 99.9% H2O + 0.1% TFA and collecting the fractions. 

11. The process of claim 10 further comprising dialyzing the fractions 
containing the properly refolded protein of step (c) first in an aqueous 
solution of 0.1% TFA resulting in a solution having a pH less than 1, 

30 then dialyzing the fraction in an aqueous solution of 0.01% TFA 

resulting in a solution having a pH of about 2 and then dialyzing the 
fractions in 0.001% TFA resulting in a solution having a pH of about 3, 
then dialyzing the solution in an aqueous solution having a pH of about 
6, then dialysing the solution in an aqueous solution having a pH of 

35 about 7, then dialysing the solution in an aqueous solution having a pH 
of about 7.8 resulting in a solution having a pH of 7.4 - 7.8 containing 
properly refolded active HCV NS3 protease. 
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12. The process of claim 8 wherein after the buffer containing the 
protease of step (c) is applied to the reverse phase chromatography 
column,the column is eluted with a 0% - 60% acetonitrile gradient. 
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